Anomaly Detection in Cybersecurity Datasets via Cooperative Co-evolution-based Feature Selection

نویسندگان

چکیده

Anomaly detection from Big Cybersecurity Datasets is very important; however, this a challenging and computationally expensive task. Feature selection (FS) an approach to remove irrelevant redundant features select subset of features, which can improve the machine learning algorithms’ performance. In fact, FS effective preprocessing step anomaly techniques. This article’s main objective quantify accuracy scalability both supervised unsupervised effort, novel using FS, called Detection Using Selection (ADUFS), has been introduced. Experimental analysis was performed on five different benchmark cybersecurity datasets with without feature performance techniques were investigated. The experimental results indicate that instead original dataset, dataset reduced number yields better in terms true positive rate (TPR) false (FPR) than existing for detection. For example, technique, multilayer perception increased TPR by over 200% decreased FPR about 97% KDD99 dataset. Similarly, local outlier factor more 40% 15% 36% Windows 7 NSL-KDD datasets, respectively. addition, all require less computational time when suitable rather entire datasets. Furthermore, have compared six other state-of-the-art based decision tree (J48).

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Fast SFFS-Based Algorithm for Feature Selection in Biomedical Datasets

Biomedical datasets usually include a large number of features relative to the number of samples. However, some data dimensions may be less relevant or even irrelevant to the output class. Selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. To this end, this paper presents a hybrid method of filter and wr...

متن کامل

Multi-task Feature Selection based Anomaly Detection

Network anomaly detection is still a vibrant research area. As the fast growth of network bandwidth and the tremendous traffic on the network, there arises an extremely challengeable question: How to efficiently and accurately detect the anomaly on multiple traffic? In multi-task learning, the traffic consisting of flows at different time periods is considered as a task. Multiple tasks at diffe...

متن کامل

fast sffs-based algorithm for feature selection in biomedical datasets

biomedical datasets usually include a large number of features relative to the number of samples. however, some data dimensions may be less relevant or even irrelevant to the output class. selection of an optimal subset of features is critical, not only to reduce the processing cost but also to improve the classification results. to this end, this paper presents a hybrid method of filter and wr...

متن کامل

Feature-based anomaly detection

A feature-based approach for detecting anomalies in spectral, spatial, temporal, and other domains is described. When the frequency of occurrence is small relative to the background, anomalies such as man-made objects in natural image backgrounds do not form their own clusters, but are instead assigned the nearest background cluster, becoming an outlier (statistical anomaly) in that cluster. Ou...

متن کامل

Feature Selection for SVM-Based Vascular Anomaly Detection

This work explores feature selection to improve the performance in the vascular anomaly detection domain. Starting from a previously defined classification framework based on Support Vector Machines (SVM), we attempt to determine features that improve classification performance and to define guidelines for feature selection. Three different strategies were used in the feature selection stage, w...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: ACM transactions on management information systems

سال: 2022

ISSN: ['2158-656X', '2158-6578']

DOI: https://doi.org/10.1145/3495165